Phase Transitions in Operational Risk 
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In this paper we explore the functional correlation approach to operational risk. We consider 
networks with heterogeneous a-priori conditional and unconditional failure probability. In the limit 
of sparse connectivity, self-consistent expressions for the dynamical evolution of order parameters 
are obtained. Under equilibrium conditions, expressions for the stationary states are also obtained. 
The consequences of the analytical theory developed are analyzed using phase diagrams. We find 
co-existence of operational and non-operational phases, much as in liquid-gas systems. Such systems 
are susceptible to discontinuous phase transitions from the operational to non-operational phase via 
catastrophic breakdown. We find this feature to be robust against variation of the microscopic 
modelling assumptions. 
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I. INTRODUCTION 

Management and mitigation of risk events are major 
concerns for banks. The goal is two-fold: first, quantita- 
tively assess the risk in terms of potential financial loss 
and second, develop solutions to control and buffer the 
impact of these losses. To facilitate systematic analysis, 
risk events are broadly classified into three categories: (i) 
Market Risk (MR), (ii) Credit Risk (CR) and (iii) Op- 
erational Risk (OR). MR refers to fluctuations in stock 
indices, changes in interest rates, foreign exchange pari- 
ties or commodity (e.g., gold, oil, etc) prices. CR refers 
to loan defaults when companies go bankrupt. Research 
on understanding risk and developing sophisticated mod- 
els has traditionally focused on MR and CR, while OR 
was initially subsumed under "other" , non credit or mar- 
ket risks. Subsequent spectacular catastrophes including 
the bankruptcy of the Orange County municipality, Cali- 
fornia, USA in 1994 pj] and the collapse of Baring Invest- 
ment Bank, London, United Kingdom in 1995 which 
were neither attributed to MRs nor CRs, helped establish 
OR as a risk category of its own. The Basel Committee 
for Banking Supervision (BCBS), an international reg- 
ulatory body, now stipulates that banks must explicitly 
reserve a portion of equity capital against OR. 

Note that the above listed three main risk categories 
are not intended to be exhaustive. Other risk categories 
exist (e.g., liquidity risk is an important category for 
bank management, or fiduciary and compliance risks, 
which arise from the judicial responsibilities of the banks' 
customers). Depending on circumstances, these may in- 
deed outweigh the importance of the three 'main' risk 
categories mentioned above. 

The Basel II document Q, released by BCBS in 2001 
and revised in 2005, is a guideline on banking regulation. 
Under Basel II, OR is defined as the risk of losses result- 
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ing from inadequate or failed internal processes, people 
and systems, or from external events. From a practical 
and management perspective it is reasonable to catego- 
rize ORs as events based on causes and specific effects. 
Possible categorizations, as described in Q, are: (i) hu- 
man processing errors, (ii) human decision errors, (iii) 
system (software or hardware) errors, (iv) process design 
errors, (v) fraud and theft and (vi) external damages. 

To quantify the capital that must be allocated for oper- 
ational failures, Basel II suggests three methodologies: (i) 
Basic Indicator Approach (BIA), (ii) Standardized Ap- 
proach (SA) and (iii) Advanced Measurement Approach 
(AMA). Under BIA, the required capital is determined 
by taking 15% of the banks' average gross income over 
the previous three years. The SA is only slightly more 
advanced in that the banks' operations are divided into 
8 business lines, each of which has a specific weight. The 
required capital is calculated as the weighted average of 
the (non-negative) gross income from the business lines 
over the previous three years. Under AMA, banks are re- 
sponsible for designing their own measurement approach 
and setting assumptions on the loss distribution. How- 
ever, banks must demonstrate that their approach cap- 
tures potentially severe "tail" loss events. The use of ex- 
ternal and internal loss data as well as internal expertise 
is permitted in the evaluation. 

Under AMA, the Loss Distribution Approach (LDA) is 
a sophisticated and popular measure. Herein the rejjuired 
capital is determined by the Value-at-Risk (VaR) 2] over 
all OR categories. VaR is defined, over a specified risk 
horizon T as the loss not exceeded with probability q, in 
excess of the expected loss that can happen under normal 
economic conditions. The VaR is thus dependent on the 
nature of the loss frequency and severity distributions. 

Choices for the loss severity distribution function in- 
clude log-normal. Gamma, Beta and WeibuU distribu- 
tions. A Poisson or negative binomial distribution is of- 
ten used for the loss frequency distribution. The common 
approach to estimate the loss distribution is to first as- 
sume that OR categories are independent. Subsequently, 
for each category z, one draws a realization Ni from the 
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loss frequency distribution and samples Ni realizations 
of the loss severity X™ (m = 1, . . . , Ni). The loss is then 
calculated as 

u^Y. ■ (1) 

?n — 1 

Finally, drawing a histogram of outcomes of normalized 
Li one obtains the loss distribution for each category. 
The capital to be allocated is the VaR of the sum of losses 
over all categories, given a time horizon T and confidence 
level q. 

Subsequent developments in modelling OR have fo- 
cused on incorporating more realistic loss frequency and 
severity distributions. In _5|| Chernobai and Rachev pro- 
vide evidence in favor of using Stable Paretian Distri- 
butions. Similarly, in [6] Allen et. al., incorporate a 
Generalized Parcto Distribution. 

A critical point concerning LDA and subsequent an- 
alytical development is the assumption of independence 
between various OR categories. However, a moment's re- 
flection will lead us to realize that these categories, which 
may be conceptualized as sets of processes, are function- 
ally dependent. 

To put this in context we illustrate it with the collapse 
of Baring Investment Bank. A trader in the banks' Singa- 
pore office hid trading losses by forging official documents 
(trading decision errors resulting in fraud), thereby al- 
lowing discrepancies to go un-noticed by higher officials. 
Pleased with his apparent performance, the bank made 
a mistake of retaining him as Chief Trader (process de- 
sign error in light of the bank not having adequate checks 
and balances, which led to decision errors by the banks' 
managers). In December 1994, compounded by increas- 
ing losses and in order to overcome the predicament, the 
trader bet on the Tokyo Stock Exchange, assuming that 
the index would not fall below 19,000. However, in Jan- 
uary 1995 a devastating earthquake hit the Japanese city 
of Kobe, resulting in the index to plummet by 7% in one 
week, and below 19,000 (scries of decision errors com- 
pounded by external damage). By the time the bank 
discovered what had happened, the trader had lost $1.3 
billion, effectively bankrupting the bank. 

This example shows that the assumption of indepen- 
dence between processes is not realistic. Recent efforts 
have therefore been directed to capture the inffuence of 
interactions between processes. A model based on the 
lattice gas analogy was proposed in Q. Investigations 
were conducted via Monte Carlo (MC) simulations and 
using a mean-field approximation for a homogeneous and 
fully connected process network. The model parameters 
were shown to be related to both conditional and uncon- 
ditional failure probabilities. Furthermore, avalanches of 
process failures were shown to be possible through bubble 
nucleation as in first order phase transition. This model 
was subsequently elaborated by Leipold and Vanini P. 
In Clemente and Ramano present a case study, sub- 
stantiated by MC simulations, incorporating realistic de- 
pendences between processes. 



At this point, it is important to note that the concept 
of OR is not restricted to the banking industry, but is 
also systemic to any large economy or commerce. Over 
the past few decades, markets have been subject to con- 
siderable de-regulation and globalization. These forces, 
coupled with an increasing reliance on sophisticated in- 
formation technology have allowed businesses to develop 
more efficient operational practices, which include Busi- 
ness Processes Outsourcing (BPO) and automated data 
collection, storage and retrieval techniques. These ad- 
vances have led to increased mutual dependencies be- 
tween economic processes, resulting in a heightened sus- 
ceptibility for catastrophic breakdowns and thus signif- 
icant financial losses. A through understanding of the 
dynamics of interacting process networks is more than 
ever desirable. 

Therefore, in consideration of the above, the aim of the 
present paper is twofold: (i) provide detailed analytical 
underpinning for the main findings of 0] , and (ii) in doing 
so, broaden the scope of that investigation to highlight 
the fact that the possibility of catastrophic breakdown in 
networks of interacting processes is a robust phenomenon 
under variation of the underlying model assumptions. 

The remainder of the paper is organized as follows. 
In Sec. [H] we introduce our model. Sec. IIIII be- 
gins by introducing three distinct techniques, covering 
non-equilibrium dynamics and equilibrium statistical me- 
chanics, which we use to derive dynamical evolution and 
stationary state solutions. In Sec. IIII Al we detail a 
heuristic solution, valid in the case of asymmetric, un- 
correlated interactions between processes. In Sec. IIII Bl 
a more systematic study, based on a Generating Func- 
tional Analysis (GFA) and valid for an arbitrary degree 
of interaction symmetry is provided. In Sec. IIII CI we 
investigate the stationary behavior of a fully symmet- 
ric network using techniques from equilibrium statistical 
mechanics. In Sec. IIVI we produce phase diagrams for 
process networks, exhibiting regions in parameter space 
where operational phases coexist with non-operational 
phases. We subsequently evaluate a loss distribution and 
thereby extend the scope of LDA to account for correla- 
tions between processes. Finally, in Sec. |V] we provide 
concluding remarks and describe possible extensions for 
further work. Some of the more technical details from 
GFA and analysis of stationary states are relegated to 
appendices |X] and |B1 respectively. 



II. MODEL DEFINITIONS 

In this section we describe the statistical model of in- 
teracting processes, which was previously introduced in 
J,]. Each process i {i = 1, ... iV) is defined by its state 
at time i, which is referred to as r]i{t). A two-state model 
is considered: ■qi{t) G {0,1}. At time t a process can 
be either up and running, rji{t) — 0, or broken down, 
'qi{t) = 1. In order to maintain a stable running state 
over the time increment t t + 1^ each process i needs 
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to receive support at time t, which is denoted Ui{t) and 
takes the form 



(2) 



Here, S R denotes the average support received by a 
process in a fuUy functional environment, while S R 
represents the impact on the average support if process j 
breaks down. Finally, the S,i{t) are zero mean, Gaussian 
random fluctuations, which represent non-systematic in- 
ternal and external perturbations to the environment 
(e.g., fire, earthquake, voltage fluctuations within elec- 
tric supphes, etc.). 

A process breaks down in the next time step if Ui (t) < 
0. Thus, the dynamics of a processes' state is given by 



(3) 



where Q{. . .) represents the Heaviside function. The 
time-step, is taken to represent one day. 

By suitable rescaling of the average support and im- 
pact parameters "di and Jij, respectively, the ^i{t) can 
be taken to have unit variance. The rescaled variables, 
can then be related to the unconditional and conditional 
failure probabilities. Consider a situation wherein all pro- 
cesses are working at time-step t. The probability that 
process i will break down in the next step, referred to as 
Pi, is given by integrating Eq. ^ over the noise £,i{t). 
Thus 



P. = * {-^^) , 



(4) 



where = 5 (^1 + ^^^(:^))- Similarly, defining 

Pi\j as the probability process i will break down in the 
next step, given that currently process j is broken down 
while all others are working, gives us 



These relations may be inverted to obtain expressions for 
the model parameters in terms of the failure probabilities, 

- -<f-i(ft), J^l = *"'fe:b) - (6) 

In general, this model is not analytically tractable. In 
Q investigations were conducted via MC simulations and 
using a mean-field approximation for a homogeneously 
connected network with uniform conditional and uncon- 
ditional failure probabilities. In what follows, we study 
and solve the model in a more interesting regime where 
conditional and unconditional failure probabilities are 
heterogeneous across the system. 

We begin by noting that if Jij < 0, the breakdown 
of process j adds support and is beneficial to process i, 
i.e., the two processes are competing. Such a situation 
is undesirable, but tends to occur on a small scale in 
large organizations. Thus, when considering a primarily 



cooperative environment, it is desireable to have, with 
high probability, Jij > 0. 

In our framework, each process does not interact with 
all, but instead a fraction of the other processes. We 
explicitly incorporate this feature by decomposing 



Ji' 



^ij Jij 1 



(7) 



where Cij € {0,1} are connectivity coefficients and 
Jij € R describes the magnitude of impact. We assume 
Cij = Cji. While this is a reasonable assumption, it is 
important to realize that the impact magnitudes are not 
necessarily symmetric. For example, consider a main- 
frame computer connected to a dummy terminal com- 
puter. If the main-frame crashes, we cannot use the ter- 
minal. However, if the terminal computer breaks down, 
it is highly unlikely that it would effect the operations 
of the main-frame. Thus, in general, Jij ^ Jji. The 
connectivity coefficients are described by the following 
distribution 



1 



N 



N 



(8) 



where c G is the average connectivity per process. 
In the case of finite N , the adjacency matrix describes 
an Erdos-Renyi random graph [10]. 

In what follows we consider the extreme dilution limit, 
wherein N — > 00, c — > 00, such that c/N 0. This 
is achieved by taking c = C'(log(iV)). This assumption 
has important implications on the structure of the con- 
nectivity graph. Firstly, each node in the graph will be 
connected to a vanishing fraction of the total number of 
nodes. Secondly, the length of a loop is C'(log(A^)) [ll|- 
Thus, taking N — > 00, the probability of finding loops 
of finite length tends to 0. The environment about each 
node is locally tree-like. 

The magnitude of the impact parameters Jij are taken 
to be quenched, i.e., fixed random quantities. To allow 
for the thermodynamic limit iV — > 00 and c — > 00 the 



^5^ mean and variance of the Jij must scale with c. We put 



c 



J 



(9) 



where the Xij are zero mean and unit variance random 
variables. The mean and variance of Jij are parameter- 
ized by Jo G R and J G R respectively. We note that if 
Jo > the interactions between processes are, on aver- 
age, supportive, which is the regime of interest. Secondly 
small J suppresses the probability of having of negative 
Jij. In addition, small J reduces the effects of frustra- 
tion [T^ . Finally, we choose the Xij to be independent in 
pairs and have the following moments 

x~ = Xij xki = (5[i,fc] -I- a5[,^i] 5[j^^ . (10) 

The parameter a G [0, 1] describes the degree of corre- 
lations between J^ and Jji, with fully symmetric inter- 
actions given by a = 1. 
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III. MODEL SOLUTIONS 

Here we investigate the dynamics and the stationary 
states of the model introduced in Sec. [Til 

For uncorrelated interactions, i.e., a = 0, a solution 
can be obtained by a heuristic argument, following lines 
of reasoning previously used to study statistical mechan- 
ics of disordered systems, in particular neural networks 
This solution relies on assuming weak and neg- 
ligible correlations between the quenched and dynamic 
random variables, which cannot be easily justified in a 
rigorous manner. 

A more exact and formal treatment of the dynamics is 
possible using generating functional analysis (GFA) [l^ . 
This technique facilitates the evaluation of order parame- 
ters ab initio and is applicable for arbitrary degree of cor- 
relation, a, between processes. In addition, it provides 
a non-trivial check to show that the heuristic solution is 
exact. 

Finally, for fully symmetric networks, a = 1, we study 
the stationary states using techniques from equilibrium 
statistical mechanics. This requires the use of thermal 
instead of Gaussian noise. 



A. Heuristic Solution 

We begin by observing that interactions of process i 
are described by the local field hi{t) = J2j '^ij J^j 
The state rii{t + 1) depends on the rij{t) that contribute 
to hi{t). Furthermore, the states r]j{t) depend on r]i{t—l) 
through hj{t—l). There is feed-back in the system, which 
induces correlations between the local fields at time t. 

In the present mean-field approach we evaluate the 
statistics of hi(t) by appealing to the law of large num- 
bers and central limit theorem. The applicability of these 
tools relies on assuming that the contributions to hi{t) 
are weakly correlated if not independent. The condition 
a = together with the limit of sparse connectivity, 
c/N — + 0, entail that contributions to hi{t) are uncorre- 
lated at finite times. This allows us to describe hi(t) as 
a Gaussian random quantity. 

Concentrating on the local field hi{t), we incorporate 
the definition of Jy given by Eq. ([9]) to rewrite the ex- 
pression as 

hzit) = —Y] Cij rij{t) + ^ V Cy Xij T]j{t) . (11) 
j j 

In the limit — > oo, the mean and variance of hi{t) are 
given by 



{hi{t)) 



and 



N 



(12) 



(13) 



respectively, [Tl|. The angled brackets ((•..)) refer to 
the average over noise terms, ^i(i), while the over-bar, 
(. . .) refers to the average over the coupling param- 
eters, Cij and Xij. 

In deriving these results, one assumes that correla- 
tions between dynamical degrees of freedom rii{t) and 
the Xij and Cij that characterize disorder, are negli- 
gible. This allows factoring of averages of the form 
CriXi, {r]i{t)) ~ CiTxiT^'^ '"^'^ (mXt)) The frac- 



tion m{t + 1) of failed processes at the next time-step, 
t + 1, evaluates to 



= ^Eq(^^-w + (14) 



In the limit A^ ^ oo, Eq. (|M1) can be evaluated by ap- 
pealing to the law of large numbers as a sum of averages 
over the random variables hj{t), ^j{t) and i)j. As hj{t) 
and (t) are Gaussian random variables, their sum is also 
Gaussian, with mean jQm{t) and variance 1 -I- J^m{i). 
Thus, first performing the joint average over local fields 
and noise, we obtain 



r7i(i 



1) = Ly^ 



Jo m(t) - 

^1 -I- J2 ^(i) 



(15) 



Finally, we note that the only i dependence in Eq. (fTS]) 
comes from the t?i. Thus, in the limit A^ — *■ oo, we obtain 
an average over the distribution (2^ . which we denote 



by (...) , giving 



TO(t + 1) 



Jo m(i) - 1? 
^1 + J2m(<) 



(16) 



We have thus obtained a simple closed expression for 
the evolution of the faction of failed processes in the net- 
work. 



B. Generating Functional Analysis 

Investigations of the dynamical properties of a system 
are conducted systematically employing GFA [l3| , which 
provides tools for the evaluation of correlation and re- 
sponse functions in terms of a characteristic functional 
of path-probabilities. Performing the average over bond 
disorder in the sum over dynamical trajectories, one ob- 
tains a family of effective single site processes parameter- 
ized by I?, 



s<t 

+ (Pit) + h{t)y (17) 
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These single site processes exhibit memory, via the re- 
sponse function, G{t, s), and are driven by colored Gaus- 
sian noise, {(f>{t)}, self consistently determined via 

J^q{s,t), 



where, 



(mm - <5[,,i] 



m{t) = (77(t)) 
q{s,t) 
G{t,s) 



drn{t) 



dh{s) 



s < t. 



(18) 
(19) 
(20) 
(21) 



Here ((. . .)) refers to average over {(j){t)}. The external 
field h(t) is primarily introduced to define the response 
function via Eq. (|2ip . The derivation and interpretation 
of these parameters are provided in appendix |^ The 
order parameter m(t), equation (|19p . describes the dy- 
namics of the fraction of failed processes. 

In the case a = 0, we recover Eq. (jl6p . which was de- 
rived via purely heuristic reasoning in Sec. IIII Al How- 
ever, for arbitrary a, the response function complicates 
averaging over {(/>(t)}. In these cases, numerical results 
for Eqs. P^ - (PT]) are obtained using Eissfeller-Opper 
(EO) simulations 15]. A key concern in performing the 
simulations is producing colored noise. Our method uses 
a Cholesky decomposition [l^ of the noise covariance ma- 
trix. We construct colored noise as a linear combination 
of white noises weighted with elements of the Cholesky 
matrix. 



C. Equilibrium Statistical Mechanics 

A process network of the form described in Sec. [Til is 
guaranteed to achieve a stationary probability distribu- 
tion for its microscopic states, if we assume a = 1 and 
use thermal noise, distributed according to 



1 



/3e 



(22) 



instead of Gaussian noise, as in Eq. ([3]). The parameter (3 
is called the inverse temperature. The Gibbs-Boltzmann 
equilibrium distribution is characterized by the Hamilto- 
nian, 

Hiv) = - X! + ^-^im- (23) 

i<j i 



A solution of this model, with disorder variables 
Cij and Xij, given by Eqs. ([7))- piI| . is achieved with 
techniques used to solve the Sherrington-Kirkpatrick 
(SK) spin-glass [l3| model. In this spirit, the Replica- 
Symmetric (RS) order parameters are obtained from the 
extensive free energy as solutions of the following pair of 
self-consistency equations. 



m = y X>z$^(/iRs) , 

7 " 

q = J Vz^ihns) , 



(24) 
(25) 



$^(a;) 



tanh 



and 



/iRS = —I? + Jom + J y^Z 



(3x 
~2~ 



(26) 



{m-q). (27) 



Derivations are provided in appendix |B] The order pa- 
rameter m, Eq. (p4|) . describes the stationary fraction of 
failed processes. 

The stability of this solution against Replica Symme- 
try Breaking (RSB) was checked by verifying that the 
Hessian at the RS saddle point is positive definite p^ . 
This requires that the so called replicon eigenvalue, given 

by 



X = l- iPJf Jvz ($^(/iRs) - <^>(,{hKS? 



(28) 



is positive. We found that for networks with parameter 
settings as investigated in section ITVl below, i.e., a-priori 
homogeneous failure probability, p = 0.01, RSB occurs 
only for J > 4.630. Hence, the regime of interest with 
small frustration, say J < 0.5, is well within the stable 
domain. 

Comparison between the equilibrium m, Eqs. (j24p - 
(P7| . and the long-term stationary behavior of m{t), Eq. 
([TS]). with a = 1 is possible, once we scale the ther- 
mal noise appropriately to match properties of the Gaus- 
sian noise. In principle, such a matching may be accom- 
plished in several different manners, each of which leads 
to similar results. Here, we prescribe that the thermal 
noise should have the same unit variance as the Gaus- 
sian noise used in the microscopic dynamics. This leads 
to /? = 7^/^/3. 



IV. 



RESULTS 



In this section we explore the consequences of theory 
developed above, with particular emphasis on phase dia- 
grams of resulting stationary states. 

For simplicity, we assume homogeneous i?, which al- 
lows us to drop the corresponding i?-averages. We justify 
this simplification by noting that any other non-trivial 
^^-distribution, for example Gaussian with mean and 
variance cr^, would not alter the qualitative behavior of 
the observables. We illustrate this for the fully asym- 
metric network, a — 0. To evaluate Eq. we exploit 
that the sum of Gaussian terms, hj{t) — dj + £,j{t), is 



itself a Gaussian with mean Jq m{t) 
1 + J"^ m{t) + cr|. This gives 



m{t -I- 1) = $ 



Jo m(<) - -d 



+ J2 m{t) + 



d and variance 



(29) 
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Up to a shift in the variance of the noise, this equation de- 
scribes the evolution of a system of asymmetrically cou- 
pled processes with uniform 'di = d. 

For stationary states for the fully symmetric network, 
a = 1, obtained from equilibrium statistical mechanics, 
a similar structural shift can be shown to apply. 

A. Fully Asymmetric Network 

Assuming long-term stationary behavior for the ob- 
servable ni{t), Eq. (jl6p . we drop the time index. The 
curve in the first panel of Fig. [T]dcpicts m as a function of 
Jq. We note the coexistence of a low-m operational and 
a high-m non-operational phase for Jq values bounded 
by the lower and upper critical values, J§ « 3.971 and 
Jq « 14.814, respectively. Within this interval there 
is an unstable branch of solutions, with intermediate 
m-values, represented by the back-bending part of the 
curve. 

The behavior is parameterized by the unconditional 
failure probability p. This dependence is made explicit 
in the phase diagram, shown in the second panel of Fig. 
[TJ The upper curve marks the Jp-boundary of the low- 
m operational phase, while the lower curve represents 
the Jg-bound of the high-m non-operational phase. A 
transition between the two phases is discontinuous for 
p < Pc ~ 0.102 and becomes continuous exactly at pc- 
Beyond Pc the phases loose their separate identity, much 
as in a liquid-gas system. In the region of small J, which 
is of interest for OR, say J < 0.5, the behavior shown in 
Fig. [1] is fairly insensitive to variation in J. 

B. Partially Symmetric Network 

Investigations of partially symmetric networks were 
conducted via: (i) evaluating the first three time-steps of 
the effective single site dynamics exactly, starting with 
random initial conditions, given by to(0), and (ii) con- 
ducting EO simulations of the dynamics to higher time- 
steps. 

The results presented in Fig. [21 were produced for an 
intermediate value of the symmetry parameter a = 0.5. 

The exact evaluation of the first three time-steps is 
depicted in the first panel. The curves represent the 
fraction of failed processes at each time as a function 
of Jo- We observe that these curves intersect the line 
m(Q) — 0.3 at exactly the same point, « 6.001, which 
marks a change in behavior. For Jq < Jq , a conver- 
gence to the low-m operational phase is realized, while 
for Jo > Jq , we observe convergence to the high-m non- 
operational phase. Except in the immediate vicinity of 
Jq , the convergence is rapid and achieved within the first 
three time-steps. 

To investigate the behavior beyond the first three time- 
steps, we used EO simulations to propagate Eqs. ITTl) - 
(PT|) up to t = 30. This time horizon is sufficient to 
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FIG. 1: First panel: Stationary observable m as a function of 
Jo. The homogeneous unconditional failure probability is p = 
0.01. Second panel: Phase diagram showing critical values of 
Jo, as a function of p, where the operational (upper curve) 
and non-operational (lower curve) phases become unstable. 
In producing both diagrams, we set J — 0.2. 



achieve stationarity, except in an infinitesimal neighbor- 
hood of the critical Jq values. In the second panel of 
Fig. [2] we provide the stationary m value as a function 
of Jq. The coexistence of the low-m operational phase 
and high-m non-operational phase is observed for inter- 
mediate Jq, in complete analogy to the fully asymmetric 
case. These Jq values are bounded between the critical 
values, Jq « 3.966 and Jq « 14.513. Here, the unsta- 
ble branch, represented by the back-bending curve, is 
computed by locating the Jq values that separate the 
regions of convergence towards either the operational or 
non-operational phase. It is noteworthy that we obtain 
an almost identical unstable branch if we use the exact 
forms of the first three time-steps. 
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FIG. 2: First panel: First three time steps of m(t) for a — 0.5 
and initial condition m(0) = 0.3, as a function of Jo. The 
curves m(l), m(2) and m(3) are given in order of increasing 
steepness about their intersection with m(0). Second panel: 
Stationary value of m as a function of Jo from EO simulations, 
with a = 0.5 and p = 0.01. 
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FIG. 3: First panel: Comparisons between stationary values 
of m for a = 1.0 for networks with Gaussian noise (solid line) 
to networks with thermal noise (dotted line), as functions 
of Jo. The inverse temperature was /3 — 7r/\/3, and the a- 
priori homogeneous failure probability is p = 0.01. Second 
panel: Phase diagram of the network with thermal noise. In 
producing both diagrams, we set J = 0.2. 



C. Fully Symmetric Network 

Investigations of fully symmetric networks, a = 1, were 
conducted via: (i) appealing to equilibrium statistical 
mechanics, Eqs. ([2^ - (P71) . and (ii) EO simulations. 

To compare the two results, we scaled the thermal 
noise to have unit variance, resulting in /3 = Tr/y/S. 

In the first panel of Fig. [3l we plot values of m as a 
function of Jq. The dotted line was produced using tech- 
niques from equilibrium statistical mechanics. The solid 
line was produced from EO simulations for a = 1. In 
both cases, the coexistence of a low-m operational and 
a high-m non-operational phase, for intermediate values 
of Jo, is observed. The coexistence regions are given by 
the intervals [4.113, 20.478] and [3.966, 14.512] for the 
networks with thermal and Gaussian noise, respectively. 
Whereas the critical values for the non-operational phase 
are almost identical, a significant discrepancy for the up- 
per critical value is observed. This discrepancy must be 



attributed to differences in the nature of the noises. 

In the second panel of Fig. [H we produce the phase 
diagram for the system with thermal noise. We observe a 
similar qualitative behavior to the fully asymmetric case, 
a — Q, given in Fig. [1] The critical point Pc ~ 0.110 is 
very close to that from the asymmetric case, Pc ~ 0.102. 



D. Loss Distribution and Capital Requirement 

Finally, in Fig. [4] we show a loss distribution for a net- 
work of 50 processes. The stationary macroscopic vari- 
able m represents the fraction of failed processes. Al- 
ternatively, m represents the probability that a process 
breaks down within an interacting environment. That is, 
effects of functional correlations with other processes in 
the network have been accounted. We proceed by draw- 
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FIG. 4: Loss distribution for a network of 50 processes con- 
structed using the LDA. A binomial distribution, with mean 
m = 0.01, was used for the loss frequency distribution. A log- 
normal distribution, with mean fi — 5 and variance — 75, 
was used for the loss severity distribution. 



ing from a binomial distribution, for each process i the 
number ki of days in a year the process fails. The daily 
failure probability is given by to. Failures on different 
days are taken to be independent. Subsequently bin- 
ning the sum of ki realizations from the log-normal loss 
severity distribution for each process, we obtain the loss 
distribution for the network. 

In the thermodynamic limit one would expect the loss 
distribution to be a Gaussian by the central limit theo- 
rem. However, for TV = 50 the distribution is still heavily 
skewed to the right with a "fat" tail and rare realizations 
of extremely large losses. 

A qualitative study was also conducted to determine 
the effects of functional correlations on the VaR. Recall 
that under LDA, VaR is used to determine the capital 
to be allocated for ORs. We consider a fully asymmet- 
ric network with homogeneous unconditional and condi- 
tional failure probabilities. First, for a non-interacting 
environment, the loss frequency distribution is binomial 
with daily failure probability, m = (/){'&)■ Using the con- 
struction as described afore, the loss distribution was 
evaluated and VaRo calculated for a confidence level 
q = 99.99%. Next, for the interacting environment, the 
loss frequency distribution is once more binomial. How- 
ever the daily failure probability, to is now given by the 
stationary operational solution to Eq (|16p . as a function 
of Jq. The loss distributions are evaluated and the VaR 
determined for the same confidence level. In Fig. ([5]) we 
plot the ratio VaR to VaRo as a function of Jq. A signif- 
icant increase in the VaR is clearly discernible. This is 
mainly driven by interactions induced through increas- 
ing Jq, which enlarges the daily failure probability, to. 
As Fig. ^ illustrates, for a company with a high de- 
gree of interdependence between processes, VaRo does 
not capture the true extent of risk. 
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FIG. 5: Ratio of VaR to VaRo, as a function of Jo. We 
consider a network of 50 processes with J = 0.2, over a risk 
horizon of 365 days. The data is fitted against a quadratic 
curve, represents by the dashed line. 



V. CONCLUSION 

In this paper we investigated networks of interacting 
and functionally correlated processes as models of OR in 
large organizations. In the limit of sparse connectivity, 
we studied the dynamical evolution and stationary states 
of these networks using analytical methods and numeri- 
cal simulations. A heuristic approach was used to exactly 
solve the dynamics of fully asymmetric networks. GFA 
and EO simulations were used to study the dynamics of 
partially and fully symmetric networks. Finally, station- 
ary states for fully symmetric networks were analyzed 
using techniques from equilibrium statistical mechanics. 

The initial variant of our model was formulated on a 
sparsely connected random graph, for which the aver- 
age connectivity per process c, satisfies, c/N ^ in the 
thermodynamic limit. However, it should be noted that 
the assumption needed to carry through the GFA as de- 
scribed in this paper is c ^ 1 rather than sparseness. 
Hence, the analysis applies to networks with non-sparse 
connectivity with c = 0(N) as well. 

From all investigated cases, we found that there exists 
a range of interactions, characterized by Jo; for which an 
operational phase coexists with a non-operational phase. 
This raises the possibility of spontaneous catastrophic 
breakdown in such process networks. We demonstrated 
this feature is robust and invariant under a broad range 
of changes in microscopic details. 

Phase diagrams were produced for fully asymmetric 
networks using the exact dynamical theory and for fully 
symmetric networks using the results from equilibrium 
statistical mechanics. Similar behavior is observed in 
both cases. For sufhciently small values of a-priori failure 
probability p, the operational and non-operational phases 
are distinct; a discontinuous transition between phases 
can be induced, for instance, by changing Jo. There is 
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a critical failure probability Pc, where the transition is 
continuous. We also note that the two critical values, 
Pc ~ 0.102 for the fully asymmetric case, and pc ~ 0.110 
for the fully symmetric case are very close. 

Given parameter values p, Jo and J for which stable 
operational and non-operational phases coexist, one can 
further qualify these phases as either absolutely stable 
or meta- stable. These notions are well known for ther- 
modynamic equilibrium systems. Therein, the absolutely 
stable phase is the one with the lower value of the free en- 
ergy. Parameter values at which absolutely stable phases 
become meta-stable (or vice-versa) are the locations of 
proper first-order equilibrium phase transitions. 

Of the systems studied in this paper, however, only 
the fully symmetric network with thermal noise can be 
characterized as a thermodynamic equilibrium system. 
Nevertheless, the notion of absolute stability and meta- 
stability of coexisting dynamic stationary states can be 
carried over to the other systems in the following man- 
ner. One denotes by rjv the fraction of the total time a 
system of size N spends in one of the coexisting station- 
ary states, either operational or non-operational. A state 
is characterized as absolutely stable if limjv^oo tat = 1 
, and as meta-stable, if limAr^oo = Q. A meta-stable 
phase can nevertheless be dynamically stable in the sense 
that, once in such a phase, a system will typically stay 
in that phase for a very long time tjq which diverges as 
N ^ CO. 

These concepts have a significant implication for OR, 
that play out for large (but finite) process networks that 
exhibit dynamically and absolutely stable operational 
phases, coexisting with a meta-stable non-operational 
phases. Small changes of the parameters characterizing 
the network, e.g. a slight increase in the average mutual 
dependency among processes — described in our model 
by a slight increase in Jg — could entail that the oper- 
ational phase becomes meta-stable. In this case a catas- 
trophic breakdown is bound to occur, even under normal 
operating conditions. A small increase in Jq would re- 
sult in a correspondingly small change in the statistical 
properties of the network. As such, there would be no vis- 
ible precursors for the transition. In other, more colorful 
words, a small change in the design of a process network 
could amount to inadvertently entering a gate carrying 
the inscription "Lascmte ogni speranza, voi ch'entrate" 

Of particular relevance in this context are a number of 
current trends such as the advance of globalization, in- 
creasing reliance on information technology, or the grow- 
ing modularization of business and production processes, 
including out-sourcing. Within our model these trends 
roughly correspond to a trend of increasing Jq; for a sta- 
ble operational phase this is equivalent to a trend of push- 
ing that phase towards meta-stability, hence eventually 
towards guaranteed catastrophic breakdown. 

A recent incident involving the retail bank HSBC out- 
sourcing business processing highlights this issue [20| . 
The outsourced data included bank account passwords. 



along with other sensitive information to an office in Ban- 
galore, India. A widespread fraud, that took roots in 
2002 and affected one thousand of the banks' clients was 
uncovered earlier this year. While this did not critically 
affect the bank, its vulnerability has increased. The fact 
that sensitive information was out-sourced to a single of- 
fice shifted the banks' stable operational phase further 
towards the meta-stability. 

It is imperative that banks and other organizations as- 
sess their stability, as described above, and check for the 
possible coexistence of operational and non-operational 
phases. As there are no detectable precursors for transi- 
tions between them, the assessment must be performed 
by stress tests, wherein artificial strains and fluctuations 
are introduced into the system and the effects are mea- 
sured. The present model readily lends itself to tests of 
this type. 

In our investigation, we only considered Poisson ran- 
dom graphs with large average connectivity c. As a con- 
sequence, there is little heterogeneity in the local envi- 
ronment about each process. However, it is known that 
realistic process networks exhibit a richer topology, which 
includes the presence of "hubs". In such situations, the 
connectivity scales according to a power-law and the an- 
alytical treatment would follow lines of reasoning used in 
small- world networks [2l|. A similar qualitative behav- 
ior with coexistence of operational and non-operational 
phases is nevertheless expected. 



APPENDIX A: GENERATING FUNCTIONAL 
ANALYSIS 

In this appendix we use Generating Functional Analy- 
sis (GFA) [1^] to formally solve the model dynamics and 
systematically justify the heuristic solution in Sec. IIII Al 
We begin by introducing the generating functional over 
source fields, ip, 



Z[xf,] = /exp(-i^^ V»(i)r/.(i)] 

\ V 4=1 t = ) 



(Al) 



The angled brackets refer to the average over all "paths" 
which are trajectories of microscopic states. Explicitly, 



Z{^\ = ^ F[{,7(0}]cxp -i5]^V.(0^.(i) (A2) 

{r7(t)} \ t=0 i=l / 

where ^'[{^7(^)1] = P[t7(0), . . . ,r7(T)] denotes the prob- 
ability of all paths over the risk horizon, T . The generat- 
ing functional can be used to compute expectation values 
and correlation functions as 



. dZ[ip] 



dMt) 



il>=0 



d^P,is)di>,it) 



(A3) 
(A4) 



tp=0 
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We are interested in evaluating the generating func- 
tional for a typical realization of disorder. This is 
achieved by averaging Eq. (jA2| over and Xij. To 
proceed, we exploit that the path probability measure 
has a Markov structure. 

T-l 

p[{ri{m - p{ m ) n ^( + 1) I ^ w ) ' (A5) 

t=o 

with transition probabilities 

N 

P{riit + l)\qit)) = Yl / VmS[,,it+i),Mt)]- (^6) 
The ^i(t) are unit mean and zero variance Gaussian ran- 



dom variables. Furthermore, 



(A7) 



1. Average Over Fast Noise 



We conduct the ^i{t) integral and disorder aver- 
age by first extracting these contributions Ui{t) — 
'^jCijJij'Hj^t) + from the Heaviside function in 

fi{t). This is facilitated utilizing a Dirac 5- function and 
its normalization property to give us 



T-l N 



V(0) V{T) 



z[rp] 5]...^p(,7(o)) nn / 



2tt 

T N 



exp -iui{t)i Ui{t)- y^Cij Jijiljit) 



exp ( -iJIXl MtH{t) 
I 



t=0 i=l 



(A8) 



In equation (jA8[) . we redefined fi{t) to denote, 

Mt) = e {u,{t) + K{t) ^ d,) . 



(A9) 



The "hatted" conjugate terms, Ui{t), are a consequence 
of a Fourier representation of the (5-function. 



2. Disorder Average 



The disorder average, which factorize in pairs, affects 
; and Xij . y\ 
ing definition 



Cij and Xij . We localize the terms involved in the follow- 



i<j \ t=Q \ 



iui{t) Ji-j r]j(t) 



+ iuj{t) Jji rii{t) 



(AlO) 



keeping the dominant terms, we obtain 

T-l 



^c,„x,, ^ exp 



Jo k{t) m{t) 



t=o 



T-l 



s,t=0 \ 

+ aG{s,t)G{t,s) 



(All) 



which depends on a set of macroscopic variables defined 
as 

N 



i=l 
1 ^ 

4 = 1 
1 ^ 

i=l 
1 ^ 

(9(s,t) = — XiMj(s)iuj(t) 

i=l 
1 ^ 

G(i,s) = — XiMj(s)77j(t). 



i=l 



We first perform the Cy average, and proceed by taking 
the Taylor expansion of the exponential in the limit c 3> 

1. Next, taking the Xij average, re-exponentiating and Z[ip] ' , one proceeds as usual by enforcing the above 



To^ achieve site factorization in the evaluation of 
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definitions of the macroscopic variables via Dirac S- 
function identities and tfieir Fourier representations. 
This subsequently generates a set of conjugate variables 
to the above set of macroscopic variables. The averaged 
generating functional is expressed in the following com- 
pact form 



= J V{...}exp{N[Ei +E2 + Es]} 



(A12) 

Here, T>{. . .} represents taking the integral over all the 
macroscopic order parameters, and their conjugates. The 
functions Si, S2 and S3, appearing in the exponential of 
Eq. (|A12[) . are defined as 



T-l 



Jo^fcWm(t) + yJ2 [Qis,t)qis,t) 

t=0 s,t=0 \ 

+ aG{t,s)G{s,t) , 

T-l 



(A13) 



S2 = i^ (m{t)m{t) + k{t)k{t) 



T-l 



+ 1^^ U(s,t)q(s,t) + Q(s,i)Q(s,i) 

s,t=0 \ 

+ G{t,s)G{t,s) \ , 



(A14) 



--3 — 



1 ^ { 

-^log p('?(o)) 

1=1 I {,,(*)} 

T-l r 

n 



du{t) du{t) 
'2^ 



exp (^-S - Vit)^ I 



(A15) 



Here S denotes the dynamic action, 

T-l / 



S = 



T-l 



/ s,t=0 \ 

-iQ{s,t)iui{t)mi{s) - iG{t,s)iui(s)r]i{t) 



Thus, we have transformed the generating functional 
into an integral with leading order in the exponential of 
N, which can b evaluated using the saddle point tech- 
nique. The contribution S3, Eq. (|A15P describes an en- 
semble of independent dynamical processes. 

3. Saddle Point Equations 



At the saddle point, the macroscopic observables of 
interest resolve to 



m{t) 
q{s,t) 
G{s,t) 



1=1 
1 ^ 

i=l 
1 ^ 



(A16) 



The represent averages over the dynamics of 

effective single site processes and have the form 



((••■)>W = 





r du(t) du(t) c 
[J 2rr (>Mt+l),Mt)]\ 


(...)e-^) 




r du(t) du(t) c 
[J 27r ^Mt+l)Jdt)]\ 







(A17) 



Averages involving a conjugate field iu(t) describe re- 
sponse functions, i.e., perturbations of expectations with 
respect to an external field. Thus, averages involving only 
conjugate fields describe a perturbation of a constant and 
are set to zero. Furthermore, by causality, G{t, s), which 
describes the response of the fraction of failed processes 
at time t, to a perturbation at time s, vanishes W s > t. 
Using the saddle point identities for other order parame- 
ters and their conjugates, we obtain Si = and S2 = 



at the saddle point. The dynamic action reduces to 



T-l 



s 



t=o 



Y - Jo Mt) 

-0 \ 

s<t J 

1 

- Y iu{t)m{s){j\{s,t) +Sis,t]) ,(A18) 



s,t=0 
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and it corresponds to single site processes, with dynamics 
77(t+l) = e I Jo m(t) + a j2 ^ G(f, s) 77(5) 

\ s<t 



equilibrium statistical mechanics can be employed to 
study the stationary behavior. The resulting Gibbs- 
Boltzmann equilibrium distribution for observing a given 
microscopic state is characterized by the Hamiltonian 



- 1? + 4>{t) + h{t) 



(A19) 



We note the following: (i) There is now a dependence 
on the fraction m{t) of failed processes, (ii) If there 
is a degree of symmetry, a 7^ 0, the dynamics is non- 
Markovian, with memory given by the response function 
G{t,s). (iii) The noise (f>{t) is colored, with {(f>{t)) = 
and (0(s)0(O> = J'g(s,t) +S[s,t]- 

We recall that the only i dependence in Eqs. (jA16P - 
(IA17P comes from -di in fi(t), Eq. By the law of 

large numbers the empirical averages in Eq. (|A16|) can 
be evaluated as an average over the distribution, 

^.i™^^E((- ••))(.) = fd^Pim---)) = (ToT 

The saddle point results, Eq. (|A16p thus take the form 



(B2) 



We obtain the stationary macroscopic order parame- 
ter, m, describing the fraction of failed processes from the 
free energy per process. In the limit, N — s- 00, the free 
energy is expected to be self averaging over the disorder. 



T 



lim log(Z) 



(B3) 



The above quenched average is calculated using the 
"replica trick" , 



/= lim lim —log'^'"''". (B4) 



m{t) = (rjit)) 



q{s,t) = (77(5) , 
dm{t) 



G{t, s) 



dh{s) 



(A20) 
(A21) 

(A22) 



In the case a = 0, there is no memory effect. Eq. 
(|A20[) is by itself sufficient to describe the dynamics. 
Evaluating the average we get 



m(t + 1) = $ 



Jo ■m{t) - -d 
+ J2 rn{t) 



(A23) 



In a manner similar to that presented in appendix IA 21 
we take the average over Cij and Xij. The resulting ex- 
pression for the free energy is, 



T 



lim lim log > 

expl iV| ^^m^ 



0^ j2 



(B5) 



However, for a 7^ 0, memory G(t, s) is relevant. The 
non-linear nature of the evolution Eqs. (|A19P - (|A20p pre- 
cludes a simple analytical characterization of the long- 
time asymptotic stationary states. 



APPENDIX B: EQUILIBRIUM STATISTICAL 
MECHANICS 



where, 



AT 



TV ^ 

i=l 



1 ^ 

N ^ 



mm 



(B6) 



For fully symmetric networks, a = 1, and thermal 
noise, ^, distributed as 



p{i) = -/3sech2 



(Bl) 



The indices 1 < /i, < n label the replicas. The above 
parameters were introduced to achieve site factorization, 
in a manner identical to that in appendix lA 21 Subse- 
quently, the free energy is expressed as an integral, which 
is evaluated via the saddle point technique. 
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f = lim lim — ^ log / T>(. . .) exp [ 
•' n^O JV^oo nN J ^ ^ ^ I 



^ X! + ^~T~ XI + i ^ ma TOa + i X Jae ^ae (B7) 



log ^ exp (-/3 iJeff ) 



Here, the effective Hamiltonian is given by 

a a 

-iX$aer?"r?^ (B8) 

a, e 

1. Saddle Point Equations 

We obtain the following expressions for the order pa- 
rameters, 



ma = 



where, 



((•••)) 



E{,,M}(---) 0Xp(-/3i?eff) 



(B9) 



(BIO) 



E{^A.} exp(-/3i?eff) 
A solution of the order parameters is achieved with the 



following replica symmetric ansatz, 

ma = m, Va (Bll) 

Qae = mS[a,e] + 9 (1 " S[a,e]), V a, £ (B12) 

On application of the ansatz and Gaussian lineariza- 
tion of quadratic replica t(Tms, wc' obtain 



m = j (/iRs) , 



q = Jvz^ (/iRs) , 
where, $/3(a;) = i ^1 + tanh and 



(B13) 
(B14) 



gj2 

hRs = -'& + Jom + J^z + ^{m-q). (B15) 
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